923 research outputs found

    Recurrent Latent Variable Networks for Session-Based Recommendation

    Full text link
    In this work, we attempt to ameliorate the impact of data sparsity in the context of session-based recommendation. Specifically, we seek to devise a machine learning mechanism capable of extracting subtle and complex underlying temporal dynamics in the observed session data, so as to inform the recommendation algorithm. To this end, we improve upon systems that utilize deep learning techniques with recurrently connected units; we do so by adopting concepts from the field of Bayesian statistics, namely variational inference. Our proposed approach consists in treating the network recurrent units as stochastic latent variables with a prior distribution imposed over them. On this basis, we proceed to infer corresponding posteriors; these can be used for prediction and recommendation generation, in a way that accounts for the uncertainty in the available sparse training data. To allow for our approach to easily scale to large real-world datasets, we perform inference under an approximate amortized variational inference (AVI) setup, whereby the learned posteriors are parameterized via (conventional) neural networks. We perform an extensive experimental evaluation of our approach using challenging benchmark datasets, and illustrate its superiority over existing state-of-the-art techniques

    Variational Deep Semantic Hashing for Text Documents

    Full text link
    As the amount of textual data has been rapidly increasing over the past decade, efficient similarity search methods have become a crucial component of large-scale information retrieval systems. A popular strategy is to represent original data samples by compact binary codes through hashing. A spectrum of machine learning methods have been utilized, but they often lack expressiveness and flexibility in modeling to learn effective representations. The recent advances of deep learning in a wide range of applications has demonstrated its capability to learn robust and powerful feature representations for complex data. Especially, deep generative models naturally combine the expressiveness of probabilistic generative models with the high capacity of deep neural networks, which is very suitable for text modeling. However, little work has leveraged the recent progress in deep learning for text hashing. In this paper, we propose a series of novel deep document generative models for text hashing. The first proposed model is unsupervised while the second one is supervised by utilizing document labels/tags for hashing. The third model further considers document-specific factors that affect the generation of words. The probabilistic generative formulation of the proposed models provides a principled framework for model extension, uncertainty estimation, simulation, and interpretability. Based on variational inference and reparameterization, the proposed models can be interpreted as encoder-decoder deep neural networks and thus they are capable of learning complex nonlinear distributed representations of the original documents. We conduct a comprehensive set of experiments on four public testbeds. The experimental results have demonstrated the effectiveness of the proposed supervised learning models for text hashing.Comment: 11 pages, 4 figure

    SchNet - a deep learning architecture for molecules and materials

    Get PDF
    Deep learning has led to a paradigm shift in artificial intelligence, including web, text and image search, speech recognition, as well as bioinformatics, with growing impact in chemical physics. Machine learning in general and deep learning in particular is ideally suited for representing quantum-mechanical interactions, enabling to model nonlinear potential-energy surfaces or enhancing the exploration of chemical compound space. Here we present the deep learning architecture SchNet that is specifically designed to model atomistic systems by making use of continuous-filter convolutional layers. We demonstrate the capabilities of SchNet by accurately predicting a range of properties across chemical space for \emph{molecules and materials} where our model learns chemically plausible embeddings of atom types across the periodic table. Finally, we employ SchNet to predict potential-energy surfaces and energy-conserving force fields for molecular dynamics simulations of small molecules and perform an exemplary study of the quantum-mechanical properties of C20_{20}-fullerene that would have been infeasible with regular ab initio molecular dynamics

    Neural Attentive Session-based Recommendation

    Full text link
    Given e-commerce scenarios that user profiles are invisible, session-based recommendation is proposed to generate recommendation results from short sessions. Previous work only considers the user's sequential behavior in the current session, whereas the user's main purpose in the current session is not emphasized. In this paper, we propose a novel neural networks framework, i.e., Neural Attentive Recommendation Machine (NARM), to tackle this problem. Specifically, we explore a hybrid encoder with an attention mechanism to model the user's sequential behavior and capture the user's main purpose in the current session, which are combined as a unified session representation later. We then compute the recommendation scores for each candidate item with a bi-linear matching scheme based on this unified session representation. We train NARM by jointly learning the item and session representations as well as their matchings. We carried out extensive experiments on two benchmark datasets. Our experimental results show that NARM outperforms state-of-the-art baselines on both datasets. Furthermore, we also find that NARM achieves a significant improvement on long sessions, which demonstrates its advantages in modeling the user's sequential behavior and main purpose simultaneously.Comment: Proceedings of the 2017 ACM on Conference on Information and Knowledge Management. arXiv admin note: text overlap with arXiv:1511.06939, arXiv:1606.08117 by other author

    Isolated Character Forms from Dated Syriac Manuscripts

    Get PDF
    This paper describes a set of hand-isolated character samples selected from securely dated manuscripts written in Syriac between 300 and 1300 C.E., which are being made available for research purposes. The collection can be used for a number of applications, including ground truth for character segmentation and form analysis for paleographical dating. Several applications based upon convolutional neural networks demonstrate the possibilities of the data set

    Neural Network Ambient Occlusion

    Get PDF

    Community detection‐based deep neural network architectures: A fully automated framework based on Likert‐scale data.

    Get PDF
    Deep neural networks (DNNs) have emerged as a state‐of‐the‐art tool in very different research fields due to its adaptive power to the decision space since they do not presuppose any linear relationship between data. Some of the main disadvantages of these trending models are that the choice of the network underlying architecture profoundly influences the performance of the model and that the architecture design requires prior knowledge of the field of study. The use of questionnaires is hugely extended in social/behavioral sciences. The main contribution of this work is to automate the process of a DNN architecture design by using an agglomerative hierarchical algorithm that mimics the conceptual structure of such surveys. Although the train had regression purposes, it is easily convertible to deal with classification tasks. Our proposed methodology will be tested with a database containing socio‐demographic data and the responses to five psychometric Likert scales related to the prediction of happiness. These scales have been already used to design a DNN architecture based on the subdimension of the scales. We show that our new network configurations outperform the previous existing DNN architectures

    User Intent Prediction in Information-seeking Conversations

    Full text link
    Conversational assistants are being progressively adopted by the general population. However, they are not capable of handling complicated information-seeking tasks that involve multiple turns of information exchange. Due to the limited communication bandwidth in conversational search, it is important for conversational assistants to accurately detect and predict user intent in information-seeking conversations. In this paper, we investigate two aspects of user intent prediction in an information-seeking setting. First, we extract features based on the content, structural, and sentiment characteristics of a given utterance, and use classic machine learning methods to perform user intent prediction. We then conduct an in-depth feature importance analysis to identify key features in this prediction task. We find that structural features contribute most to the prediction performance. Given this finding, we construct neural classifiers to incorporate context information and achieve better performance without feature engineering. Our findings can provide insights into the important factors and effective methods of user intent prediction in information-seeking conversations.Comment: Accepted to CHIIR 201

    The Value of Paratracheal Lymphadenectomy in Esophagectomy for Adenocarcinoma of the Esophagus or Gastroesophageal Junction: A Systematic Review of the Literature

    Get PDF
    Esophageal adenocarcinoma; Nodal metastases; Upper mediastinal lymphadenectomyAdenocarcinoma esofágico; Metástasis ganglionares; Linfadenectomía del mediastino superiorAdenocarcinoma esofàgic; Metàstasis ganglionars; Limfadenectomia del mediastí superiorBackground The role of upper mediastinal lymphadenectomy for distal esophageal or gastroesophageal junction (GEJ) adenocarcinomas remains a matter of debate. This systematic review aims to provide a comprehensive overview of evidence on the incidence of nodal metastases in the upper mediastinum following transthoracic esophagectomy for distal esophageal or GEJ adenocarcinoma. Methods A literature search was performed using Medline, Embase and Cochrane databases up to November 2020 to include studies on patients who underwent transthoracic esophagectomy with upper mediastinal lymphadenectomy for distal esophageal and/or GEJ adenocarcinoma. The primary endpoint was the incidence of metastatic nodes in the upper mediastinum based on pathological examination. Secondary endpoints were the definition of upper mediastinal lymphadenectomy, recurrent laryngeal nerve (RLN) palsy rate and survival. Results A total of 17 studies were included and the sample sizes ranged from 10-634 patients. Overall, the median incidence of upper mediastinal lymph node metastases was 10.0% (IQR 4.7-16.7). The incidences of upper mediastinal lymph node metastases were 8.3% in the 7 studies that included patients undergoing primary resection (IQR 2.0-16.6), 4,4% in the 1 study that provided neoadjuvant therapy to the full cohort, and 10.6% in the 9 studies that included patients undergoing esophagectomy either with or without neoadjuvant therapy (IQR 8.9-15.8%). Data on survival and RLN palsy rates were scarce and inconclusive. Conclusions The incidence of upper mediastinal lymph node metastases in distal esophageal adenocarcinoma is up to 10%. Morbidity should be weighed against potential impact on survival

    Transformation Pathways of Silica under High Pressure

    Full text link
    Concurrent molecular dynamics simulations and ab initio calculations show that densification of silica under pressure follows a ubiquitous two-stage mechanism. First, anions form a close-packed sub-lattice, governed by the strong repulsion between them. Next, cations redistribute onto the interstices. In cristobalite silica, the first stage is manifest by the formation of a metastable phase, which was observed experimentally a decade ago, but never indexed due to ambiguous diffraction patterns. Our simulations conclusively reveal its structure and its role in the densification of silica.Comment: 14 pages, 4 figure
    corecore